-
Notifications
You must be signed in to change notification settings - Fork 150
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ENH] Add and Validate n_layers
, n_units
, activation
& dropout_rate
kwargs to MLPNetwork
#2338
base: main
Are you sure you want to change the base?
Conversation
Thank you for contributing to
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thx for taking care of it.
In general:
- All parameters should be defined as private in constructor before starting anything
- the assertions and all the list checking should only be in build_network method not constructor, this will avoid causing issues on CI (check for example how fcn network is implemented)
- Also given we are parametrizing the network, the associated classifier and regressor should be parametrized as such (also docs) and use them when calling the network
- I left some other comments to check
ceea333
to
57a13fe
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great, just the original default number of units is actually 500 not 200, once this is changed we can get this in
squash commit from aeon-toolkit#2338
Dropout rate(s) for each dense layer. If None, a default rate of 0.2 is used. | ||
Dropout rate(s) are typically a number in the interval [0, 1]. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As far as I can see, if dropout_rate
is None
, it will be set to
self._dropout_rate = [0.1]
self._dropout_rate.extend([0.2] * (self.n_layers - 1))
I think we should adapt this info.
Closes #2337 .